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ABSTRACT 

We consider the optimum depth of a cluster survey selected using the Sunyaev- 
Zel'dovich effect. By using simple models for the evolution of the cluster mass function 
and detailed modeling for a variety of observational techniques, we show that the op- 
timum survey yield is achieved when the average size of the clusters selected is close to 
the size of the telescope beam. For a total power measurement, we compute the opti- 
mum noise threshold per beam as a function of the beam size and then discuss how our 
results can be used in more general situations. As a by-product we gain some insight 
into what is the most advantageous instrumental set-up. In the case of beam switch- 
ing observations one is not severely limited if one manages to set the noise threshold 
close to the point which corresponds to the optimum yield. By defining a particular 
reference configuration, we show how our results can be applied to interferometer ob- 
servations. Considering a variety of alternative scenarios, we discuss how robust our 
conclusions are to modifications in the cluster model and cosmological parameters. 
The precise optimum is particularly sensitive to the amplitude of fluctuations and the 
profile of the gas in the cluster. 
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1 INTRODUCTION 

The Sunyaev-Zel'dovich (SZ) effect is due to the re- 
scattering of cosmic microwave background (CMB) photons 
■ by the hot electrons situated in intracluster media (Sun- 
' yaev & Zel'dovich 1972). It has been realized that it should 
provide a reliable mechanism for selecting a sample of clus- 
ters (Haiman, Mohr & Holder 2001; Holder, Haiman & Mohr 
2001) and a number of instruments sensitive to the mi- 
crowave frequency range (15GHz-350GHz) are under con- 
struction to perform this task (Kneissl et al. 2001; Holder 
et al. 2000; Lo et al. 2002; Mauskopf et al. 2000; Browne 
et al. 2000; Mohr et al. 2002; Kosowsky 2003). Since the 
cluster mass function is sensitive to cosmological parame- 
ters (Thomas & Carlberg 1989; Oukbir & Blanchard 1992; 
Scaramella, Cen & Ostriker 1993; Barbosa et al. 1996; Viana 
& Liddle 1996; Eke, Cole & Frenk 1996) such surveys are 
likely to be an important part of the program to probe the 
late-time history of the universe, searching for the origin of 
the apparently accelerating expansion of the Universe. 

The number of clusters as a function of redshift and 
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flux is a strong function of 57 m , the mass density of the uni- 
verse in units of the critical density, and as, the normaliza- 
tion of the matter power spectrum on scales of 8/i _1 Mpc, 
and is a weaker function of the Hubble constant Ho = 
100/ikmsec _1 Mpc _1 , the equation of state u>o of the dark 
energy, Q, and its evolution wi. Here, wo and wi are defined 
by -Pq/pq = w = wo + wiz. It has been shown that the de- 
generacies within the space of cosmological parameters are 
such that cluster surveys are complementary to those which 
might be obtained using observations of Type la Supernovae 
at high redshift (Weller, Battye & Kneissl 2002; Battye & 
Weller 2003). 

Since the distribution of the clusters obeys Poisson 
statistics the errorbars deduced on the cosmological param- 
eters will be very sensitive to the number of clusters found 
in a given survey. Therefore, we anticipate that finding the 
largest number of clusters possible for a given integration 
time will be close to giving the best constraints on cosmo- 
logical parameters, although this may also weakly depend on 
their distribution with redshift. Clearly a survey which will 
find ~ 1000 clusters will constrain cosmological parameters 
much better than one which finds ~ 100. Having a much 
larger sample would also seem desirable in terms of reduc- 
ing the systematic errors due to the poorly understood gas 
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physics. Fortunately, we will show that, by assuming we can 
model the gas physics well, one can get to within a factor of 
two of the optimal situation for a total power measurement. 
This is more tricky in more complicated experimental situa- 
tions such as beam switching whereby an ill-judged selection 
of the survey depth can lead to one finding very few clusters 
at all. 

The goal of this paper, therefore, is to discuss issues re- 
lated to optimizing the yield of a given survey; the optimum 
being defined as the survey in which the largest number of 
objects are found in a given amount of observing time. This 
is a question one is always faced with in any kind of astro- 
nomical measurement: how long does one integrate for on a 
given patch of sky? Obviously the shorter the required inte- 
gration time for individual patches, the more sky area can 
be covered. However, if one doesn't integrate long enough 
only the very brightest objects will be found. Clearly, one 
is always guided in such situations by one's expectations of 
what is there to be observed. 

For the present case of clusters, one can attempt to 
make a simple estimate by assuming (1) the clusters are 
point-like objects in the telescope beam, (2) their comov- 
ing number density is constant and (3) the Universe is Eu- 
clidean. In this case the number of objects with flux density, 
S, greater than the limiting flux density of a survey iSij m is 
N(S > Sum) oc AQ/Sf/^, where AQ is the angular coverage. 
Since for fixed total integration time Su m oc \j AQ, one can 
deduce that N oc AQ 1 ^ 4 oc Sf/^, and hence a survey with 
largest possible area (that is, the whole sky) would be opti- 
mal in this case. Under these assumptions the survey would 
be flux density limited and hence an instrument with the 
largest possible collecting area would be optimal in terms 
of making the largest number of detections in the shortest 
period of time. 

Unfortunately, in most cases clusters are unlikely to be 
point-like, nor is their comoving density likely to be con- 
stant due to evolution and, of course, the Universe is not 
Euclidean. As we will describe in section 2, the distribu- 
tion of clusters as a function of mass and redshift can be 
modelled in a sensible way for a given set of cosmological 
parameters. However, the key to understanding the issue at 
hand here is how the limiting mass of a survey is related 
to the instrumental noise and the angular resolution of the 
telescope. 

Shallow surveys, as suggested by the simple estimate 
above, will only find clusters which are very large and close 
by, and such clusters cover many beam areas. Since the same 
signal is spread over a number of beam areas, such objects 
can dip below the detection threshold even though they are 
very large. It is clear that this effect will compete with that 
due to Euclidean source counts in our initial estimate, lead- 
ing to an optimum at a particular value of the noise and 
angular coverage. We will show that this optimum will de- 
pend on the size of the beam: larger beam sizes, say 8', will 
require a deeper^ survey to achieve the optimum, whereas 
a much shallower survey will be optimal for much smaller 
beam sizes, say 1'. Essentially the optimal situation is for 
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Figure 1. The top row is the negative cluster profile for an obser- 
vation frequency of less than 217GHz. The bottom row are three 
hypothetical beam profiles: (a) the beamsize is much smaller than 
the cluster and the observation resolves it. In this case the obser- 
vation is surface brightness temperature limited, (b) the beamsize 
exactly fits the cluster profile. This is the optimal situation, (c) 
the beamsize is larger than the cluster profile. In this case the 
observation is flux density limited. 



most of the clusters in the survey to be the same size as the 
beam as explained in Fig. 1. 

Once the beam is sufficiently small to resolve all the 
clusters, the survey becomes surface brightness temperature 
limited, rather than being flux density limited, and increas- 
ing the collecting area no longer acts to increase the rate 
of detection. This illustrates an interesting feature of clus- 
ter survey, which is in contrast to intuition one might have 
from considering standard point sources and, as we shall see, 
leads to some other somewhat counter-intuitive conclusions. 

In section 3 we will discuss how to derive a relation 
for the limiting mass of the survey as a function of red- 
shift and the limiting flux, incorporating the effects of a 
Gaussian beam profile. Our initial motivation is to model 
idealized total power measurements using a single dish with 
a circular aperture. We will allow the observations to be 
smoothed using a window function which is exactly tuned 
to every possible cluster size above the maximum resolution 
of the telescope. Under these assumptions we show in sec- 
tion 4 that for a particular minimum beam size there exists a 
unique value of the noise which corresponds to the optimum 
in terms of the total of clusters detected, and for all beam 
sizes less than around 2' (FWHM) this is equivalent in terms 
of the number of clusters found in the survey, although the 
precise optimum noise will be different for different beam 
sizes. 

Accurate total power measurements are only feasible in 
space or at a very dry site (for example, the South Pole) 
on the earth's surface, mainly due to the effects of water 
vapour. In section 5 we discuss how our calculations can be 
adapted to situations closer to those experienced in reality. 
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In particular, wc will illustrate how our calculations can be 
applied to techniques which minimize the effects of the at- 
mospheric contamination which is ubiquitous at the relevant 
frequencies at most sites. These techniques involve either use 
of a beam switch or an interferometric array. For the most 
part our conclusions are similar to those for a total power 
experiment, although more care is required in interpreting 
them. 

Obviously our goal has some clear dangers since one is, 
by necessity, required to assume a specific distribution of 
clusters. In situations where the source population is com- 
pletely unknown, attempts at optimizing observations in 
this way are completely useless! However, the distribution 
and precise characteristics of clusters are becoming much 
better understood; something which is only likely to im- 
prove as observations progress. We have, therefore, used the 
so-called concordance cosmological parameters (Ostriker & 
Steinhardt 1995) and have attempted to model most of the 
relevant features of the distribution of clusters. In section 6 
we test the robustness of our predictions to variations in 
the cosmological parameters and in the cluster model. Small 
variations do not affect the optimum significantly but, not 
surprisingly, large variations substantially change our pre- 
dictions. We quantify this in terms of the parameters. Of 
particular concern is the uncertainty in the detailed mod- 
elling of the poorly understood gas physics within the clus- 
ter. It is clear that more sophisticated modelling of this 
physics would be required before we can have full confidence 
in our predicted optima; they should, nonetheless, provide 
a useful guide. 

For the purposes of this paper, we will ignore the effects 
of confusion due to radio and infra-red (IR) point sources, 
the CMB anisotropics and that due the distribution of clus- 
ters themselves which are below the flux limit. These will be 
discussed in a future publication. Suffice to say that the lim- 
iting flux densities and temperatures that we suggest here 
are compatible with confusion being negligible except for the 
most extreme cases. Very much deeper surveys run the risk 
of becoming confusion limited. We note that some of our 
conclusions concerning the most advantageous instrumental 
setup do not take into account possible strategies to mitigate 
problems which might occur due to confusion backgrounds. 

We should note that our optimization are only of rele- 
vance to ground-based experiments specifically designed to 
perform high resolution CMB measurements and SZ surveys 
at a single frequency. For general purpose, multi-frequency 
instruments such as PLANCK, the observational strategy 
(in the case of PLANCK, an all sky survey) has been de- 
signed to take into account a wide range of possible appli- 
cations, not just the ability to detect the SZ in clusters. 

Throughout this paper we will work in natural units 
where c = % = &b = 1. Standard conversion factors can be 
found, for example, in Kolb & Turner (1989). 



2 MODELLING THE CLUSTER 
DISTRIBUTION 

Except in section 6 where we test the robustness of our 
predictions, we will use a fiducial cosmological model 
which is a flat ACDM model with (fi m , as, h, wo, wi) = 
(0.3,0.9,0.72,-1,0). This model is compatible with re- 



cent CMB (Spergel et al. 2003), Large-Scale Structure 
(LSS) (Percival et al. 2001; Percival et al. 2002), Hubble 
constant (Freedman et al. 2001) and weak lensing measure- 
ments (Contaldi, Hoekstra & Lewis 2003). The precise value 
of ug is currently under close scrutiny and is very important 
in making predictions of cluster observations since the num- 
ber of objects found is extremely sensitive to its value. We 
should note that our chosen value is larger than some other 
values quoted in the literature (Allen, Schmidt & Fabian 
2002) and we will return to this issue in section 6. Where 
necessary we use a baryon density which satisfies current 
estimates (Spergel et al. 2003) and a spectral index of ini- 
tial density fluctuations ns = 1. Only very extreme changes 
from these two parameters can lead to significantly different 
results. 

We will use the mass function deduced from the suite 
of N-body simulations (Evrard et al. 2002) performed by 
the VIRGO collaboration. The comoving number density of 
objects at redshift z in the mass range M to M + dM is 
given by 



±- {z ,M) = -0.22^1^- 
dM y ' ; Ma M dM 



cxp{-|A( Z ,M)| 3 - 86 } , (1) 



where A(z,M) — 0.73 — \og[D(z)<TM], p m (to) is the matter 
density at the present day, am is the over density in a virial- 
ized region containing mass M and D(z) is the normalized 
growth factor which is dependent on cosmological param- 
eters. Here, M is defined to be M200, that inside a region 
with an overdensity of 200, see Battye & Weller (2003) for 
more details on how to apply this to cluster calculations. 

The clusters are assumed to be spherical and in hydro- 
dynamical equilibrium, in which case the virial mass, M v ; r , 
and virial radius, R v i T , are related by 



4-7T 

M vir = — p clit (z)A c (z)R 



3 

vir 1 



(2) 



where p C rit(z) is the critical density at redshift z and A c (z) 
is the relevant overdensity which can be computed from the 
spherical collapse model (Partridge & Peebles 1967; Gunn 
& Gott 1972; Lahav et al. 1991). This can be adapted to 
apply to models where w ^ — 1 (Wang & Steinhardt 1998; 
Battye & Weller 2003). 

We will assume that the electron density weighted tem- 
perature (T e )„ can be related to M v i r under the assumption 
that the cluster is isothermal via 



W-'h-^Mr. 
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[A c (z)E(zf 



-1/2 



l + (l + 3w) 



Qq(£) 
Ac(z) 



-3/2 



(3) 



where the Hubble parameter H{z) = HqE(z) and Q.q(z) 
is the density of the dark energy component relative to the 
critical density at redshift z. We will use T* w 1.6 as the nor- 
malization factor for most of our discussion, but will consider 
the effects of other values of T* in section 6. 

The profile of the hot gas (,(&), whose mass is related to 
the total mass of the cluster by M gas = / gas M v i r , is modelled 
using the isothermal /3-model (Cavaliere & Fusco-Femiano 
1976) with P = 2/3 as suggested by observations, 
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where 8 C is the apparent angular size of the cluster core, 
a = # v j r /#c and 6 V1V is related to _R v ir via the angular di- 
ameter distance <1a{z). We will assume that the gas frac- 
tion /g as is universal and redshift independent with value 
of 0.12 (Mohr, Mathiesen & Evrard 1999). Essentially, un- 
certainties in the global value of / gas can be absorbed into 
the mass-temperature (3), via modifications in the T*. Vari- 
ations in / gaa from cluster to cluster are more difficult to 
deal with. 

The total number of clusters with z < z max is computed 
using 



-/Vtot (Zn 



Afl 



dz 



dV 
dzdQ. 



dM 



M lim (z) 



dn 
dM ' 



(•») 



where AQ is the angular coverage and dV/(dzdQ) is the 
FRW comoving volume element. Mb m (z) is the limiting mass 
of the survey which one can relate to the limiting flux density 
5ii m as discussed in the section 3. We will use z ma x = 1.5 
since we are interested in clusters whose redshifts can be 
found using standard techniques. At present it is difficult 
to accurately deduce cluster redshifts which are ~ 1. We 
optimistically assume that, by the time these SZ surveys 
have been performed, techniques will have been developed 
to at least deduce photometric redshifts routinely out to 
z — 1.5. There are expected to be very few clusters with z > 
1.5. In principle our calculations could easily be modified to 
optimize the number of clusters between any two redshifts 
as desired. 



3 COMPUTING THE LIMITING MASS 

The flux density per beam due to the Sunyaev-Zel'dovich 
effect is given by 



s* = 2/ 2 r C MB/i(a0r, 



(6) 



where Tcmb is the CMB temperature, Y is the Compton y- 
distortion integrated over the beam profile and h(x) is given 
by 



h(x) = 



1) 



tanh(s/2) 



= -2 + 0(x), 



(7) 



where x = 27t//Tcmb ~ //56.8GHz, / is the frequency 
of observation and A is the corresponding wavelength. The 
limiting virial mass of the survey can be related to the flux 
density per beam (Battye & Weller 2003) by 



2f 2 TcMB(T e ) n a T h(x) / g a S M vir 



m a d\ 
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where 
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(9) 



or is the Thomson scattering cross-section, fi c ~ 1.143 is 
the mean molecular weight of the electrons and m e ,m p are 
the masses of the electron and the proton, respectively. The 
corresponding beam area is given by 



2xi0 1& 



s 

S 10 15 





' 1 


- y 




s 


- 


/ 

/ 


- 


/ 




/ 




/ 




- / 




- / 


- 


- / 




- / 




-/ s 




7 / 




V 


1 



Figure 2. The limiting mass for a family of surveys with / = 
30GHz and ^lim/^pwHM = 500/^Jy arcmin - 2 , that is, a family 
with the same system temperature, but different beam sizes. Each 
of the curves uses a beam of fixed resolution placed over the cen- 
tre of the cluster. The solid line has #fwhm = 1', the dotted 
line $fwhm = 2', the short-dashed line Spwhm = 4' and the 
long dashed line 0pwHM = 8'. The shape of the 8' beam ap- 
proximately follows that of ff'JJ' which is indicative of the point 
source approximation and with $fwhm = 1' it is more diffi- 
cult to detect clusters which are close by (Bartlett 2000), when 
compared to those at higher redshifts. Using the fiducial model 
jV to t(1.5) = 0.33,8.3 x 10~ 2 ,1.6 x 10~ 2 ,2.4 x 10" 3 per deg 2 for 
$FWHM = 1', 2', 4' and 8' respectively. Note that each of the 
curves tends to the same non-zero value as described in Appendix 
A. 



(10) 



fib = / dQ,B{9) . 



The corresponding thermodynamic temperature can be 
computed using the standard formula 



and 

9{x) = 



2f 2 h(x)Q h 



(11) 



(12) 



tanh(ar/2) 

is the frequency dependence of the SZ brightness tempera- 
ture. We note that this quantity is not the peak of the SZ 
effect due to the cluster, but the average integrated over a 
region whose size is set by the instrumental beam size. For 
the purposes of our discussion we will define a non-standard 
temperature unit which includes this frequency dependence, 
allowing us, where desirable, to quote frequency independent 
limits in the subsequent discussion, 

T _ r THERM _ r C MB(r c ) n q T / g a S Af vir j(b,q 



g(x) ii e m e m v d\ 

This can be related to the integrated y-distortion by Y — 
T/Tcmb ■ 

The function I(B, £) < fib takes into account the effects 
of the beam B(9) centred directly over the centre of the 
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In the absence of beam smoothing, much smaller beams give 
an altogether different shape since they find it more difficult 
to detect clusters which are very large, that is, those which 
are very close by. This led Bartlett (2000) to suggest that 
surveys performed with very high resolution instruments 
should be very deep, contrary to the argument based on 
Euclidean source counts we presented earlier. However, one 
must also attempt to take into account the effects of beam 
smoothing and be careful to make comparisons with con- 
figurations of the same temperature sensitivity, as we have 
done here, rather than flux density sensitivity which is re- 
lated to the overall collecting area. As we will see this leads 
to a conclusion somewhere in between these two extremes, 
strongly dependent on the resolution of the telescope. 

In a diffraction limited observation, the highest reso- 
lution beam size possible is #fwhm k X/D where D is 
the diameter of the telescope. Coarser resolution is possi- 
ble by either physically under-illuminating the dish using 
appropriate feed horns, or by beam smoothing the resul- 
tant map. Let us consider the latter possibility of an in- 



Figure 3. The solid line represents the limiting mass for = 
500/xJy, / = 30GHz and 0™^ HM = 1' computed using the al- 
gorithm discussed in the text allowing to take any value. In- 
cluded also are curves of fixed resolution with Sfwhm = 1' (dot- 
ted line), 2' (short dashed line), 4' (long dashed line), 8' (short 
dashed-dotted line), 16' (long dashed-dotted line). The limiting 
flux densities have been adjusted so that Sn m = 500/iJy, lmjy, 
2mjy, 4mjy and 8mjy respectively to model the effects of beam 
smoothing. Note that the solid curve represents the envelope of 
the others. 



cluster. We will take this to be a Gaussian with full-width 
half-max (FWHM) 6>fwhm, 



B(0) = exp 



2ol 



(14) 



where <Tb = #fwhm/\/8 log 2 and fib = 2ira h . It can be 
shown analytically (see Appendix A) that, under sensible 
assumptions for £(0), which apply to the isothermal /3-model 
and other possible cluster profile functions, the divergence 
in (8) at z — due to <Ia(z), is regularized by I. Hence, 
Miijn(z = 0) is non-zero; this is critical in the context of our 
subsequent discussion since it is this which mitigates against 
our earlier comments in the introduction on a shallow survey 
being optimal. 

Our aim is to use relation (8) to deduce the limiting 
mass as a function of redshift for a survey with noise quan- 
tified by limiting flux density Sum P er beam. In order to do 
this we first consider the simple case where we do not take 
into account of any kind of beam smoothing and assume 
the cluster to be 'detected' if the flux density in a beam 
which lies directly over its centre is greater than Sii m . The 
beam correction I is a complicated function of M v i r and z, 
but (8) can be inverted to give Mn m as function of z us- 
ing the Newton-Raphson method. We have computed this 
for / = 30GHz and the results are illustrated in Fig. 2 for 
a set of surveys which have constant $iim/#FWHMi that is, 
ones with the same brightness temperature sensitivity. We 
see that for large beams, #fwhm ~ 8', the shape of the curve 
reflects the dependence for small z of the limiting mass 
expected in the point source approximation (Bartlett 2000) . 



Nh copies of the minimum beam size, that is, 57b = iVbf?™ 111 . 
By adding the noise together for the Nb beams, we see that 
Tiim = TrSa" 1 / V-Wb where T^ 1 is the noise temperature cor- 
responding to the minimum beam size, and the noise level 
in terms of flux density increases to respect (If). Hence, 
the limiting flux density becomes Sn m = \J A^S/hiT where 
■Siim" is the flux density limit equivalent to T^ 1 . This would 
correspond to taking a map with resolution determined by 
Ofwhm an< A convolving it with a Gaussian window function 
with FWHM V^FWHM- 

A simple, and unrealistic, assumption is that one can 
smooth the resulting map to the size of any cluster larger 
than the minimum beam size which might contribute to it, 
that is, Nb can take any value > I adjusted to the size of 
the cluster in question. This represents the most one could 
hope to achieve from a particular observation. The number 
of beam areas covered by a cluster weighted by the gas profile 
function is given by 



N h = 



J ede ((e) 

f6d6B(8) 



(15) 



This can be used in conjunction with the Newton Raph- 
son method to compute the minimum limiting mass. This 
is shown in Fig. 3 as well as curves of fixed resolution with 
iVb = 1,4, 16, 256, which correspond to #fwhm = (2') fc and 
N h = 4 fe with k = 0, 1, 2, 3, 4. We have used Sg 1 ™ = 500/xJy, 
/ = 30GHz and 

flfwHM — !'■ The minimum limiting mass 
clearly represents the envelope of the family of curves with 
Nb taking any value between I and oo. This will be the ba- 
sis for our idealized total power measurements discussed in 
section 4 

One might be prevented from using this beam smooth- 
ing procedure to detect objects very much larger than the 
minimum resolution, that is, there exists a maximum beam 
size #fwhm arjove which there should be no beam smooth- 
ing. This might be due, for example, to instrumental base- 
lines. Moreover, in practice the value of Nb will be required 
to take a discrete set of values (we note that this may not 
necessarily correspond to Nb taking integer values). One 
simple procedure that one might use is to take 4 beam areas 
from the highest resolution map to form a map with half the 
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Figure 4. The solid line is the optimum limiting mass allowing 
for beam smoothing with any value of N^, for = 500/xJy, 

/ = 30GHz, 6^ X HM = oo and 6>^ HM = 1' and is the same 
as the solid line in Fig. 3. The other curves represent the ef- 
fects of imposing a finite value of #J?^f HM still allowing for any 
value of iVj, up to this limit. The dotted line has 9™{£)iM = 1' 
which is equivalent to the 1' curve in Fig. 3, the short-dashed 
line it is 2' and for the long-dashed line it is 4'. It is clear that 
extending the beam much above 4' will have little effect on the 
number of clusters found in the survey. Using the fiducial model 
iV t ot(l-5) = 0.33,0.57,0.67,0.70 per deg 2 for 0™ X HM = 1', 2', 4' 
and oo respectively. 



resolution; one which can then be iterated further to give an 
even lower resolution map. In this case N b = 4 k for some 
integer k. 

We investigate the effects of these two more realistic 
scenarios on the limiting mass in Figs. 4 and 5. In Fig. 4 
we reduce #fwhm while still allowing any real value for Nb, 
whereas in Fig. 5 we only allow Nb = 4 fe , while keeping 
6>fwhm = oo. We see that if #™whm > 4', the limiting mass 
and, hence, the number of objects found in the survey, is 
not significantly affected by a finite value of S'J^jjm- More- 
over, it is clear that the discreteness effects introduced by 
considering only values of iVb which are multiples of 4 are 
unlikely to have a particularly strong effect on the number 
of clusters found in the survey since the solid curve in Fig. 5 
is very close to that in Fig. 3. 



4 IDEALIZED TOTAL POWER 
MEASUREMENTS 

We will consider, initially, a simple single telescope observa- 
tion characterized by the system temperature, T sya , the re- 
ceiver bandwidth, A/, the number of simultaneous beams, 
rib, generated by a focal plane array, and the diameter of the 
telescope D. The expected 1 — o instantaneous sensitivity 
is Ti ns t « T S ys/ (??V A/) where r\ is the aperture efficiency 
of the telescope. In the rest of our discussion we shall use 



Figure 5. The solid line is the exact envelope of the dashed lines 
which are the same as the Sfwhm = l'j 2' and 4' from Fig. 3 
which was computed by allowing A^ = 4 fc as described in the 
text. Notice that it is similar to, but not the same as, the solid 
line in Fig. 3 which allows any value of Nb- It is unlikely that 
accurate inclusion of these effects of realistic beam smoothing 
will significantly alter the conclusions of this paper. 




Afwhm to refer to the minimum beam size 



X/D. 



0FWHkl[arcmin] 

Figure 6. Contours of the predicted number of clusters computed 
using (18) found per unit C as a function of $fwhm an d T Um 
per beam in the case of optimal beam smoothing. We have used 
2 max = 1.5. The contours represent 95%, 90%, 80%, 50% and 20% 
of the maximum number that could be found with particular fixed 
values of T sys , r], nb,t and A/. Notice that, for Sfwhm < 2' there 
is a degenerate line Ti; m = 4T CT ss 55^tK/(0pwHM/l') along which 
the survey would have optimal yield. 



The question which we would like to answer is how this 
sensitivity can be deployed most effectively. The extremes 
are to point the telescope at the the same piece of sky for 
the whole period of integration, or to cover the whole sky 
with a very shallow survey. Of course, we are likely to do 
neither, but the noise temperature per beam of any survey, 
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100 - 




Figure 7. Equivalent to Fig. 6 without allowing for beam smooth- 
ing. There is a localized maximum near #fwhm ~ 2' and 
Ti; m 30/jK. No degenerate line exists. We see that survey se- 
tups with 0fwhm < 1' find it difficult to find anything close to 
the optimal number of clusters, since they are unable to detect 
objects at low redshift without beam smoothing. 



Tcr, and the area covered, Afi, will satisfy 



4 log 2 



t #FWHM??\/nbiA/ 



0.94- 



fFWHM?7\/nbtA/ 



(16) 



where t is the integration time and we have assumed a Gaus- 
sian beam profile. Note that the temperatures, including the 
system temperatures, quoted here, unless explicitly stated 
otherwise, are expressed in terms of the sensitivity to the 
SZ effect (see the earlier discussion of these non-standard 
temperature units). 

This noise can be converted into the equivalent flux den- 
sity limit via the standard formula 



= 3, 



W i h{x) 



f 



GHz 



(17) 



We will choose the detection threshold to be 4 times the 
noise, that is, Tu m = 4T CT and Sn m = 45 CT . 

One can compute the number of objects for the family 
of surveys which satisfy (16) by substituting for Af2 from 
(16) into (5) to give 



N to t (Zn 



lim FWHM 



dz 



dV 
dzdQ, 



dM 



M Um («) 



dn 
dM ' 



where 



C = 



7T n h trj 2 Af 



64 log 2 



T 2 

J- ays 



0.07 



n h tAf 
T 2 



(18) 



(19) 



Here, Mn m (z) is now a function of Tij m and #fwhm- This 
expression shows the obvious dependencies on the parame- 
ters which we are taking to be constant T sy s, rib, t, f] and 
A/ : reducing T sys and increasing A/, t, rib, r\ will lead to 
an increase in the number of clusters found. 
The quantity 



Msd = 



n b r) 2 A/ n b r/ 2 A/ 



T 2 

J- sys 



^THERMj' 



[9(x)f , 



(20) 



can be considered as a figure of merit for a single dish instru- 
ment if #fwhm < 2'. Ignoring possible systematic effects it 
quantifies the number of objects an instrument will find for 
a fixed integration time, or conversely the time taken to find 
a fixed number of objects is oc Mg^- We see that once this 
limit on #fwhm is saturated, the rate of making detection is 
independent of the overall collecting area, in contrast to the 
case for point sources. We also see that the survey goes from 
being flux density limited to surface brightness temperature 
limited at this saturation point. Given that currently avail- 
able receivers have values of T sys and Af which are close 
physical limits, one sees that the only way to improve the 
detection rate is by increasing rib. We note that the largest 
value of rib is limited by the size of the focal plane of the 
telescope. 

An important consequence of an SZ survey being sur- 
face brightness temperature limited is that, for fixed values 
of T ays and Af, the optimal frequency to observe at would 
be the lowest possible, that is, x ~ 0, where the SZ effect 
is strongest in terms of brightness temperature. In reality, 
the best available values for T ays and Af also are dependent 
on / (typically the best available instantaneous sensitivity, 
Tsya/^/A/, is a very weak function of frequency), and this is 
also mitigated by, both, the fact that point source confusion 
is greatest at the lowest frequency, and probably more sig- 
nificantly, that achieving the saturation resolution requires 
a larger diameter telescope for lower frequencies. Nonethe- 
less, it questions the prejudice that the best frequency at 
which to design an instrument to perform a blind SZ search 
is near the maximum in the flux density of the SZ effect at 
/ w 150GHz. 

In Fig. 6 we have plotted (18) as a function of Th m and 
Afwhm assuming that one can tune any beam smoothing to 
the size of any cluster larger than the beam size, that is, 
Nb can take any value between 1 and oo (that is, #fwhm = 
oo) and using z max = 1.5. The computed values have been 
normalized so that the maximum value is one, and hence the 
precise value plotted quantifies the efficiency of a particular 
observational setup. 

We see that there is a degenerate line with Th m = 4T' CT ss 
55/*K/(0fwhm/1') for #fwhm less than the saturation res- 
olution of 2', which can also be phrased in terms of the 
integrated ry-distortion limit Yn m ~ 2 x 10 -5 /(#fwhm/1')- 
Therefore, we have shown that for sufficiently high resolution 
surveys, it should be possible to perform a survey which is 
optimal in the sense of finding the largest number of clusters 
for a given receiver sensitivity. For lower resolution surveys, 
there is still an optimum for a given value of 0fwhmj but 
this would be sub-optimal relative to performing a survey 
with the same receivers, but using a telescope with higher 
resolution. 

It is simple to understand the degenerate line in Fig. 6: 
one has to achieve a given brightness temperature sensitiv- 
ity at the saturation resolution of 2' in order to detect the 
clusters with masses ~ 10 14 Mq which it is most efficient 
to detect. The required sensitivity at a higher resolution is 
that, which when smoothed, is optimal at the saturation 
resolution, that is, Tii m (9FWHM oc S\i m f $fwhm is resolution 
independent. This corresponds to it being independent of 
JVb as discussed earlier and this is an important property of 
a single dish, total power measurement. 

We have done the same calculation, but this time allow- 
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"PWHM 


/ = 15GHz 


30GHz 


90GHz 


150GHz 


0.5' 


35 


140 


870 


1030 


1' 


70 


280 


1730 


2030 


2' 


160 


620 


3830 


4500 


4' 


480 


1870 


11520 


13470 


8' 


1500 


5810 


35690 


41620 



Table 1. The optimum limiting flux densities (in £iJy) for differ- 
ent set ups, which correspond to the maxima of the graphs curves 
in Fig. 8. We have assumed the idealized total power setup al- 
lowing for beam smoothing described in section 4. The numbers 
are quoted to the nearest 10/^Jy except for $fwhm = 0.5' and 
/ = 15GHz where the flux is quoted to the nearest 5/jJy. 



/lmin 

"fwhm 


/ = 15GHz 


30GHz 


90GHz 


150GHz 


0.5' 


230 


225 


185 


110 


T 


110 


110 


90 


55 


2' 


60 


60 


50 


30 


4' 


45 


45 


40 


25 


8' 


35 


35 


30 


20 



Table 2. The thermodynamic temperature limits (in fiK) quoted 
which correspond to the flux density limits presented in Ta- 
ble 1 to the nearest 5fiK. Note that these can also be com- 
puted approximately from T™ ERM ~ 55/xK|s(x)|/(0 FWH m/1') 
for Spwhm < 2'. 



ing for no beam smoothing and this is presented in Fig. 7. 
The result is very different to that in Fig. 6. There is no 
degenerate line and the optimal configuration is localized 
around (9fwhm ~ 2'. This is because smaller beams find it 
more difficult to detect objects which are larger than the 
beam, when no beam smoothing is allowed. 

We have also computed the optimal configuration in 
terms of the flux density limit 5n m which will, in contrast 
to the non-standard temperature units as defined above, be 
dependent on the observation frequency. This is plotted in 
Fig. 8 for / = 15 GHz, 30 GHz, 90 GHz and 150 GHz and 
a range of different values of #Pwhm- The different values 
of 6>fwhm correspond to slices of constant #fwhm through 
the contour plot in Fig. 6. We see that in each case there is 
a maximum in the curve at a particular value of Sum and 
these are tabulated in Table 1 for various possible configu- 
rations. Note that once again those with Afwhm > 2' are 
seen to be non-optimal. The corresponding thermodynamic 
temperature limits are presented in Table 2. 

The existence of an optimum at S — S op t can be un- 
derstood by considering N(> S) oc S~ a ^ s \ the number of 
clusters detected with a flux limit per beam greater than S. 
It is clear that for S <C Sopt the function a(S) < 2 due to 
the properties of the mass function and for S 3> S opt we 
have that a > 2 since the detection of objects very much 
larger than the beam is very inefficient. S op t is defined by 
c(Sopt) — 2 and it is clear that it will increase with the 
beamsize. 
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Figure 8. The relative number of clusters with z < 1.5 that 
would be found in a survey with / = 15GHz (bottom left), / = 
30GHz (bottom right), / = 90GHz (top left) and / = 150GHz 
(top right). In each case $fwhm = 0-5' (short dashed line), 1' 
(dotted line), 2' (solid line), 4' (dashed-dotted line) and 8' (long 
dashed line). The optimal survey strategy would be to choose 



normalized by the largest possible number of clusters that could 
be found in survey with that frequency and hence we see that 
surveys with ^p^jjj^ > 2' are not optimal. The curves represent 
a slice through the contour plot in Fig. 6 expressed in flux density 
units which naturally introduces a dependence on /. 



5 MORE REALISTIC OBSERVATIONS 

5.1 Systematics in total power measurements 

The ability to perform beam smoothing is not a given, but 
we have shown that it is an essential feature of an SZ survey 
instrument with resolution < 2'. It appears that a number 
of ground based single dish experiments have been able to 
make accurate measurements of the angular power spectrum 
of the primary CMB anisotropics over a wide range of scales. 
However, the reliable detection of extended objects like clus- 
ters requires the maps to not only have the power distributed 
correctly, but also requires the phases to be accurate. 

A variety of instrumental systematics, such as base- 
line offsets caused by the receivers and the atmosphere, 
could lead to artifacts which might destroy the integrity 
of a map once it is smoothed to low resolution. In terms 
of the design of an SZ survey this leads to the possibil- 
ity that #fwhm < oo. In what follows we will assume that 
some beam smoothing is possible, but that this is limited to 
some constant multiple of #fwhm- I n particular, we consider 
the effects of $fwhm = ^fwhm and $fwhm = ^fwhm m 
Figs. 9 and 10. We see that in both cases the degenerate line 
seen in Fig. 6 is truncated at small values of #fwhm (higher 
values for 0j?vvhm ~ 3#J?w HM ). This can be understood by 
realizing that instruments with very small value of Afwhm 
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1.0 

9rwHM[orcmin] 



but this time with # F ^f HM 



Figure 9. Equivalent to Fig. 7, 
^FWHM' There is a degenerate line in the same direction as in 
Fig. 6, but the finite amount of allowed beam smoothing prevents 
very high resolution instruments being efficient for this purpose. 




FWHM[orcmin] 



Figure 10. Equivalent to Fig 
^FWHM' Notice that the degenerate line extends to smaller val- 
ues of $fwhm than in Fig. 9, but that is does not extended to 
zero as in Fig. 7. 



are unable to achieve critical value of 2' by beam smoothing 
when these constraints are placed on #fwhm- We conclude 
from this, that although extra resolution might be desirable 
(for example, to allow subselection within the sample), ex- 
cessive amounts can be damaging to an instrument's survey 
potential. 



5.2 Beam switching 

Beam switching can be used to subtract out the effects of 
the atmosphere in single dish observations when it varies 
on timescales shorter than the characteristic integration 
time. To investigate its effect, we will consider two Gaus- 
sian beams separated by angle 9 S on the sky. It is likely that 
the properties of the atmosphere will prevent 9 S from being 
greater than some value (probably somewhere between 5' 
and 10'), but from the point of view of our discussion we 
will assume that it can take any value between #fwhm an d 

CO. 



First, let us consider a flat sky approximation with co- 
ordinates x and y, and let r e (x,y) be the optical depth to 
the SZ effect along the line of sight parameterized by (x,y). 
If a Gaussian beam, with width denoted by a, is centred at 
the point (x s , 0) then the resultant flux will be proportional 
to 



dx I dy r e (x,y) exp 



1 

2&~ 2 



{(x - x s ) 2 + y 2 ) 



(21) 



where the range of integration is x 2 + y 2 < R 2 ir . The optical 
depth r c (x,y) — r c (0)£(r) is only dependent on the radial 
coordinate r = y^x 2 + y 2 . By making the coordinate change 
x = r cos (j> and y — r sin <j>, one can write this integral as 

' K(r,x s ), (22) 



= (0) 



where 



dr r£(r) exp 



r 

2a 2 



K(r, Xs) = exp 



x s 
2a 2 



d(f> exp 



rx s 



(23) 



If K(r, x s ) = 1 then this expression is that of a Gaussian 
beam observing the centre of the object as used in our dis- 
cussion of idealized total power measurements. This factor 
represents the effects of a non-centred beam. 

One can re-write (23) in terms of the modified Bessel 
function Io(z) and, by considering the difference between a 
beam pointing at the origin and one pointing at x — x s , an 
effective beam B c a{9) can be constructed which represents 
the effects of a beam switch. If we also make the transition 
from the flat-sky coordinate r to the angular coordinate 9, 
factor T(B, Q becomes 



Z(B eS ,Q = 



where 



J dnB cff (9)ge) 

j dttC(0) 



B eS {9) =exp ( -^-5- 



exp 



^2 ) J ( — 



(24) 



.(25) 



In Appendix B, we show that, under sensible assumptions 
as the form of £(0), X(-B e ff,C) ^ as # v ir/#FWHM — > oo, in 
which case it is clear that Mu m —> oo as z — > in contrast to 
the case of a Gaussian beam profile for which Mu m tends to a 
finite value. The reason for this is simple: as one approaches 
z — the clusters become much larger, but the beam switch- 
ing makes it impossible to find objects which are very much 
larger than the switching angle. This phenomenon will play 
a key role in our subsequent discussion. 

We have computed Mu m (z) for #fwhm = 1', / = 30GHz 
and S™™ = 500/iJy for a range of values for 9 S compared 
to that for no switching and the results are presented in 
Fig. 11. At this stage, we ignore the effects of beam smooth- 
ing. We see that for large values of w 10', the limiting 
mass is the same as for no switching when z > 0.5. How- 
ever, as suggested above, we see that for all finite values of 
0s, Mii m — > oo as z — > 0. As 8 S is reduced the value of z 
above which the no switching case is a good approximation 
increases. For 9 S ~ 3' the limiting mass is affected (~ 50%) 
even at z ~ 1; given that the number of clusters found in the 
survey is very sensitive to the limiting mass this can lead to 
the number of clusters detected being significantly reduced. 

The calculation discussed above does not take into ac- 
count beam smoothing which is more subtle than in the case 
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Figure 11. The solid line is the limiting mass of a survey with 
/ = 30GHz, 6>™ HM = 1' and S™^ = SOO^Jy as for that in 
Fig. 2. The other curves are for different values of 8 B . The dotted 
line is 9 B = 2', the short dashed line is 3', the long dashed line 
is 4', the short dashed-dotted line is 5' and the long dashed- 
dotted line is 10'. We see that the effect of beam switching is to 
make the survey less efficient particularly at low redshifts where 
clusters have a much larger angular diameter. None of the curves 
presented here have included any beam smoothing. 



Figure 12. This illustrates the possible effects of beam smoothing 
when beam switching is involved. The curves represent the same 
values of 8 B as in Fig. 11 and the parameters of the survey are 
the same. The solid line is that for no switching and is equivalent 
to the solid line in Fig. 4. We allow beam smoothing up to the 
switching angle; that is, we set 0£?^f HM = 8 B . We see that in each 
case the limiting mass tends to oo as z — ► (see Appendix B). 
For 8 B > 5' the limiting mass is almost equivalent to that for no 
switching except for z < 0.1 where, due to the small comoving 
volume, there are very few clusters available to be detected. 



of a single-dish observation. In the case of switching it will be 
impossible to perform beam smoothing beyond the switch- 
ing angle, that is, 9Jw HM = 8 B . We illustrate the effects of 
this on the limiting mass the same values of /, #fwhm f° r a 
range of values of 8 B in Fig. 12 compared to the case of no 
switching. Once again we see that for all finite values of 9 B , 
Mum — ► oo as z — > 0, but that for 8 B > 4' the limiting mass 
is only significantly different when compared to the case of 
no switching when z < 0.2. Since there are only very few 
clusters to be detected at these low redshifts, we conclude 
that for 8 B > 4' and 0i?WHM — 1' beam switching does not 
affect the number of clusters detected in a given survey. 

Now that we have a method for computing the limiting 
mass in the case of beam switching, we can attempt to inves- 
tigate how to make the optimum choice for the survey depth 
as we did for the single dish, total power case. We will as- 
sume that one can beam smooth up to #fwhm — $s and will 
do this only for the case of #j?WHM = 1' an d / = 30GHz, and 
will work in terms of Sn m (this can be converted into Tn m us- 
ing (17)). Fig. 13 is the equivalent to Fig. 8, but for a range 
of values of 8 B . We see that, so long as the optimal choice 
of fijim is made, the number of objects is only marginally 
reduced for 9 B > 5'. For 8 B = 4' the number of objects found 
is only reduced to about 80% at the optimum, while for 
8 B — 3' one can only achieve about 60% of that possible for 
no switching. The number of objects found is significantly 
reduced for lower values of 8 B . Moreover, we see that the 
optimal value of Sn m is reduced from 280^tJy (see Table 1) 
for no switching to around 200^Jy (T;™ ERM « 80/iK) for 
8 S = 4' and lOO^Jy (T nm « 40^K) for 8 S = 3'. Although 



the optimal value of Su m only changes by about a factor 3 
in going from 8 B = oo to 8 B — 3', the number of objects 
found for larger values of Sii m falls off much more quickly 
than in the no switching case. Both these effects can be ex- 
plained by the fact that it is difficult to detect objects at 
very low redshifts for small values of 8 B : as 8 B decreases one 
has to make surveys which are deeper and deeper in order to 
find anything, and very shallow surveys are unlikely to find 
anything at all since they are only sensitive to the largest, 
very nearby clusters which are effectively resolved out by 
the beam switch. 



5.3 Aspects of interferometric observations 

As we have already pointed, out making total power, single- 
dish measurements are not always possible. Interferometric 
methods using a telescope array provide an alternative to 
this, which can reduce the sensitivity of the measurements 
to a range of systematic effects notably those due to the 
atmosphere. The techniques involved (interferometers work 
in Fourier space, whereas single dishes work in real space) 
are very different. In this section we will discuss how, un- 
der certain circumstances, they can be thought of as being 
a special case of the total power measurements which we 
described in section 4 and we will outline how they can be 
incorporated into our framework. There are definitely some 
caveats to their application to an interferometer which we 
will ignore, for the most part, here. 

Let us consider an array with n r receivers mounted on 
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Figure 13. The relative number of clusters with z < 1.5 that 
would be found in a beam switch surveys with / = 30GHz and 
"fwhm = 1' for different values of the switching angle 8 B . The 
curves are labelled as in Figs. 11 and 12. The solid line is that 
for no switching and the curves are normalized to one at the 
maximum for this at around Sy Im ~ 280/^Jy. As the switching 
angle decreases, searching for clusters becomes less efficient, and 
the optimal limiting flux decreases to around Sn m ~ 100/xJy for 
0s = 3' which is about 60% efficient. 



telescopes each with diameter D, shortest baseline s > D 
and longest baseline d as measured between the centres of 
the dishes which are furthest apart. Such an array will have 
B = n r (n r — l)/2 baselines and can be thought of as having 
a synthesized beam which can be represented by a Gaussian 
with FWHM 9 s F y wHM ~ x / d and 

a primary beam, which 
corresponds to the field-of-view, which is a Gaussian with 
FWHM M w \/D. The shortest baseline sets the scale 
of the largest structure which can be observed. It is our 
claim that in the language of section 4, we can approximate 
the effects of an interferometer by a Gaussian beam with 
$fwhm ~ -V^ an d a maximum beam angle of #fwhm ~ -V s 
assuming that the density of visibilities is uniform in the 
u-v plane, but with two important differences. We note that 
the assumption that the beam is approximately Gaussian 
requires that the number of baselines is sufficiently large 
so that the negative response inevitable in interferometric 
observations is negligible. 

The first important difference is in terms of the sensi- 
tivity of the array. Such an arrangement will have a flux sen- 
sitivity corresponding to a single dish with the same overall 
collecting area oc n T D 2 , but the temperature sensitivity is di- 
luted by the so-called 'filling factor' r/ F = n r D 2 /(D + d) 2 < 1 
(w n r D 2 /d 2 if d > D), such that 



T a = 



(26) 



from which we can see that any survey with a fixed integra- 
tion time should satisfy 



4 log 2 



(27) 



and hence the corresponding instrumental figure of merit is 
given by 



Mi 



T 2 

J-sys 



frpTHERMy 



(28) 



The limit ryp = 1 corresponds to a fully-filled array and is 
equivalent in some sense to a single dish with focal plane 
array of rib = n r elements. We see that an interferometer 
which is fully- filled will be the most efficient. This is not 
particularly important in the context of our discussion here 
since it just means that it will take longer to achieve the 
same temperature sensitivity, or given a fixed integration 
time, one would be able to cover less area to the same depth. 

The other important difference which needs to be taken 
into account can be very significant in terms of the opti- 
mum depth of a survey. The quantity Ti im # F wHM need not 
be independent of resolution for an interferometer, as it is 
for a single dish; this is a consequence of the density of vis- 
ibilities within the u-v plane generated by positions of the 
antennae in the interferometer. We will describe an interfer- 
ometer for which Th m # F wHM is independent of resolution as 
being uniformly filled. 

We can attempt to model an interferometer within our 
framework by assuming that the baseline lengths can be 
grouped into m bins and that there is an approximate res- 
olution #fwhm ~ X/bi corresponding to each of the bins 
1 < i < m, where bi is the average length of the baselines in 
bin i. There should be no significant error introduced by bin- 
ning of the baselines if m sufficiently large. We will assume 
that at each resolution the response of the interferometer 
can be modelled by a Gaussian beam with FWHM # PWHM 
which should be a good approximation so long as the number 
of baselines with length < bi is sufficiently large. By recon- 
structing the beam in this way, we are effectively ignoring 
the data from long baselines. The flux limit corresponding 
to resolution FWHM will be given by 



•Slin 




(29) 



where B l is the number in bins with length < bi. 

There is no particular reason, a priori, 
why Ti; m FWHM oc Sl iln /6 FWlIM should be independent of 
i, which is what would be required for the interferometer to 
be uniformly filled! Given a specific set of baseline lengths, it 
would be simple to compute the overall limiting mass as that 
which corresponds to the envelope of the individual values 
of i. 

One can re-phrase this discussion in terms of u-v plane 
of the interferometer. If u — |u| is the amplitude of the 
Fourier variable u = (it sin <f>, u cos </>), defined in multiples 
of the wavelength A, then F w H m = l/u min and # F whm = 
1/itmax, where w ma x and it m in are the maximum and mini- 
mum values of u. In principle, one can create a beam, which 
we will assume is Gaussian, corresponding to any value 
itmin < u < M max , with FWHM and flux density limit given 
by 



Su 



1/2 



Sni n , (30) 
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Figure 14. A realization of the simple interferometer arrange- 
ment discussed in the test with n r = 6. d and s is the longest and 
shortest baselines respectively. D is the dish size. In this case there 
are three different types of baseline which, in order of increasing 
length have solid, dotted and dashed lines. 



where 

{' u 

B u = 2-7T / u p(u')du . 



(31) 



is the number of visibilities < u and p(u) is the density of 
visibilities. The case of a uniformly filled interferometer can 
then be seen to correspond to p(u) = po, that is, a uniform 
density of visibilities in the u-v plane. 

An interesting parameterization of interferometers in 
this context is to consider p(u) oc u 1 where 7 > 0. For 
u S> u m i„, B u oc u 1+2 and hence 



5 min 
lim 



(32) 



If we choose Nb = (iimax/u) 1 '' 2 in order detect clusters which 
cover A?b beams, then we find that 



y rrniin 
'-'lim 



(33) 



We will explore this parameterization in future work, but it 
is clear that unless 7 = the behaviour of an interferometer 
when smoothing will differ from that of the idealized single 
dish case discussed in the previous section. 

In order to illustrate this further, we will consider a very 
specific arrangement of the telescopes. Let us assume that 
they are arranged in a circular configuration. In order to 
make our discussion simpler we will assume that n r is even, 
in which case there n r /2 different baseline lengths. If these 
baseline lengths are denoted bi for 1 < i < n r /2 then 



bi = dsin 



(34) 



where there are n T of length bi for i < n r /2 — 1 and n r /2 with 
length & 7lr /2 — d. A diagram illustrating the arrangement for 
n T — 6 is presented in Fig. 14. 

The shortest baseline is given by s — dsin[7r/n r ] and 
hence the ratio of the shortest to longest baseline, which is 
is given by 



equal to #fwhm/$fwhm 



(35) 



The largest possible filling factor achievable with such a con- 
figuration is when s — D, that is, when all the dishes in 



<2xl0'« - 




Figure 15. The mass limits for the simple interferometric con- 
figuration discussed int the text with rii = 8. The bottom left has 
PWHM = V and 5 l?m = WO^y, the bottom right 0»» HM = 2' 



°FWHM — 
8' and S,™ m 



4' and 



2mjy, 

and the top right has ^pwHlvl = ^' and 5° = 5.8mjy. In each 
case the solid line corresponds to the equivalent single dish obser- 
vation assuming the maximum filling factor allowed for n r = 8, 
rfp = 0.6, the dotted line to i = 1, the short dashed line to i = 2, 
the long dashed line for i = 3, and the dot-dashed line to i = 4. 
The overall mass limit which would be achieved for each of the 
interferometers would be the envelope of the curves correspond- 
ing to i = 1 to 4. Note that in each case the mass limit for the 
equivalent single dish is better than envelope of those for the cor- 
responding to the interferometer. 



the circular arrangement are touching. In this case r)v = 
n r /(l + cosec(-7r/n r )) 2 which tends to zero like l/n r for large 
n r . 

A range of resolutions are possible between the mini- 
mum and maximum , given by 



A 
b~i 



qmm 
'FWHM ■ 



which result in the limiting flux density of 

Slim z 



2l 



- emu 
-Jlin 



mm 
m 1 



for i < n r /2 — 1 and limiting temperature of 

Tli m = 



2; 



1 . 2 
■ sin 



1 lim 



(36) 



(37) 



(38) 



It is clear that this interferometer is not uniformly filled 
for finite values of n r since 



1 lim^F' 



27 



[ lim 'FWHM j 



(39) 



which is clearly not independent of i. Such a configuration 
is close to being uniformly filled for the longest baselines if 
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n v is large. If we define j = n r /2 — i and assume that n r is 
large then 

2Ti m #FWHM ~ T}Sxi $FWHM + , (40) 

that is, the longest baselines are close to being uniformly 
filled. But even in this case, the filling factor is not uniform 
for small i where the discreteness takes hold. 

We have plotted the mass limits for 1 < i < 4 for 
n r = 8 in Fig. 15 for a range of values of #fwhm along 
with the predictions of a single dish with the same maxi- 
mum resolution. In this case the best filling factor possible 
is 7/f = 8/(1 + cosec(7r/8)) 2 ~ 0.6. The actual mass limit 
that one would achieve with an interferometer is given by 
the envelope of the curves for i = 1 to 4, and it is clear to 
see that one always does worse in terms of detecting clus- 
ters with an interferometer when compared to an ideal single 
dish with the same resolution. We note that despite this ad- 
vantage, systematics are nearly always worse for single dish 
observations. 

The values of have, in each case, been chosen to 

be the values which are close to the optimal survey strategy 
in the case where the interferometer is uniformly filled (but 
this might not be optimal here). There are some interest- 
ing aspects of the problem which are raised by Fig. 15. We 
see that for #fwhm = 1' if z < 1 the mass limit is ostensi- 
bly defined by that for i = 1 corresponding to the shortest 
baselines, whereas for #fwhm = 8' it is that for i = 4 cor- 
responding to the longest baselines which dominates except 
for the very smallest values of z. 



6 ARE OUR PREDICTIONS ROBUST? 
6.1 Cosmological parameters 

In making our calculations we found it necessary to make 
choice of cosmological parameters. It is unlikely that the 
total number of clusters found in a given survey is very 
strongly dependent on the parameters h, u>o and wi al- 
though how they are distributed as a function of z should 
be. However, it is known that it will be strongly depen- 
dent on as and fl m . We have made sensible choices for these 
parameters in the light of presently available cosmological 
observations, but it is perfectly possible that our choices are 
incorrect. The value we have used, erg = 0.9, is that preferred 
by measurements of the primary anisotropies in the CMB 
and weak lensing. However, measurements of the X-ray lu- 
minosity function of clusters and the gas fraction therein 
suggest that erg w 0.7 (Allen, Schmidt & Fabian 2002). 

In light of this discrepancy, we have performed our cal- 
culations for a range of different values of as between 0.7 and 
1.1. We find that there is a degenerate line in the equivalent 
of Fig. 6 for every value of as we tried and that a survey 
can be optimal for #fwhm < 2'. However, the normalizing 
amplitude is modified by erg . We find that 

T lim = 55 M K(^|) 41 , (41) 

at #fwhm = 1'. The strong dependence on erg is due to the 
exponential dependence of the number of clusters on it. 

We have also investigated the effects of variations in 
f2 m . Fig. 16 presents the equivalent of Fig. 6 for Q m = 0.5. 




8FWHu[arcmin] 



Figure 16. Equivalent to Fig. 6 for n m = 0.5. There is the 
characteristic degenerate line, but with an amplitude of Tn m 
SOfiK at #fwhm = 

Yet again there is a degenerate line for #fwhm < 2' and for 
this case the amplitude for #fwhm = 1' is Tn m « 80^K. 

6.2 Cluster model parameters 

Another possible uncertainty of our calculation is the cluster 
model which we have used. The two important features of 
the model in this respect are the M-T relation (3) and the 
cluster profile function. 

We have investigated the effects of varying the param- 
eter T*, although we note that it is perfectly possible that 
the relation between M and (T ) n as a function of z is not 
precisely of the form (3). In this case we find that the nor- 
malizing amplitude at Sfwhm = 1' is given by 

T lim = 55 M K , (42) 

with the degenerate line and saturation value of #fwhm as 
before. It is easy to understand this linear dependence of 
T* since modifying it just leads to a corresponding modi- 
fication of the temperature of each cluster. It is clear that 
modification to the relationship between M and (T e ) and to 
the z dependence can led to more complicated modifications 
to the optimal value of Th m dependent on how this affects 
the overall distribution of clusters. Once more information is 
available on this it might be worth investigating this further. 

We have also varied the parameters of the isothermal f3- 
model, a and f3. For a general value of (3 the profile function 
is given by 

where 

pa 

J(a,b) = / {l + x 2 )-^ dx. (44) 
Jo 

Fig. 17 shows the effect of varying a = 5, 10, 50 and 100 
and P = 1/3, 2/3, 1 and 4/3 for a survey with 6>fwhm = 1' 
and / = 30GHz. We see that the optimal value of Sii m varies 
only by about a factor of 2 when a varies by a factor of 20 
between 5 and 100. We note also that larger values of a lead 
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10 100 1000 10 4 10 100 1000 10 4 

s n m /MJy s, im /MJy 

Figure 17. The effect of varying the cluster model parameters 
a and f} for $fwhm = 1' an d / = 30GHz. On the left we vary 
a, while keeping /3 = 2/3. The dotted line is a = 5, solid line is 
a = 10, dashed line is a = 50 and dot-dashed line is a = 100. 
On the right we vary /3, with a = 10. The dotted line is /3 = 1/3, 
the solid line is j3 = 2/3, dashed line is j3 = 1 and dot-dashed 
line is f} = 4/3. In each case the curves are normalized in such 
a way that the maximum value of Ntot for a = 10 and f3 = 2/3 
is one. The solid lines are the equivalent to that in the bottom 
right of Fig. 8. We see that the optimum value of Sii m is relatively 
independent of a, but appears to change significantly as we vary 
/3. In particular the optimum value is Su m = 280^tJy for /3 = 2/3 
and is ~ lOOO^tJy for f3 = 4/3. 

to more clusters detected; about a factor of 2 between a = 5 
and 100. This is because for larger values of a the clusters 
are more point-like, making a given mass easier to detect. 

The variations as a function of /3 are more significant. 
The optimal value of 5n m for /3 = 2/3 is 280/^Jy (see Table 
1) whereas for f3 = 4/3 it is ~ lmjy. Moreover the total 
number of clusters increases by a factor ~ 10 in going from 
f3 = 1/3 to j3 = 4/3. As before this is because as /3 in- 
creases the cluster becomes more point-like: for j3 = 1/3, 
the electron number density n c oc r~ , whereas for (3 — 4/3, 
n c oc r -4 . 




s„ m /MJy 

Figure 18. The effect of different mass functions for 0FWHM = 1' 
and / = 30GHz. The solid line is the standard case of the Evrard 
et al (2002) mass function used in the earlier discussion, notably 
Fig. 8. All the other surveys have been normalized to the number 
of objects which would be found relative to this mass function. 
The dotted line is for Jenkins et al (2001), long dashed line is for 
ST with M = Meo and the short-dashed line is for PS with M = 
M200- The optimum value appears to be relatively independent 
of the mass function, but the overall number of clusters varies by 
about a factor of 2. 

In Fig. 18 we have plotted the same quantity as in 
Fig. 8 for #fwhm = 1' and / = 30GHz using the mass 
function already used (Evrard et al. 2002) which is based 
on VIRGO consortium simulations, their alternative mass 
function (Jenkins et al. 2001), the ST (Sheth & Tormen 
1999) mass function using M = M^o and the PS (Press & 
Schechter 1974) mass function using M = M200. We see 
that the maximum of the curve, and hence the optimal sur- 
vey depth is almost independent of the mass function within 
the bounds of those functions which we have investigated. 
Rather worryingly, the overall number of clusters found for 
a given flux limit varies by a factor of two for the differ- 
ent mass functions. This point was highlighted in Battye & 
Weller (2003), and it is clearly something which needs fur- 
ther investigation. Nonetheless, we conclude that uncertain- 
ties in the mass function are not significant when considering 
the optimal depth of a survey. 



6.3 Mass function 

The final major aspect of our calculation which could con- 
tain systematic errors is the comoving number of density 
of objects dn/dM which is computed from N-body simula- 
tions (Evrard et al. 2002). It is clear that the simulations 
used, while the best available at present, only have limited 
dynamical range, and the fitted formula for dn/dM could be 
inaccurate at the extreme values of M, both large and small. 
In order to investigate the possible effects of this on our cal- 
culation we have tried three alternative mass functions in 
addition to the one used in the rest of the paper. 



7 CONCLUSIONS 

We have discussed in detail the optimal depth of an SZ clus- 
ter survey and have as a by-product gained some insight in 
the optimal experimental configuration. Ignoring the effects 
of confusion noise, for every beamsize considered we have 
shown that there exists an optimum depth at which one will 
find the largest number of clusters for a fixed integration 
time. The number of clusters found as a function of the flux 
density or temperature limit is typically quite broad, which 
is fortunate since the calculations we have made contain 
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many uncertainties. We have shown that the most signifi- 
cant of these are those relating to the M — T relation and 
the cluster profile. Even within the known uncertainties in 
these parameters it should be possible to get within 50% of 
the optimum. 

For an instrumental setup with # F whm < 2' and as- 
suming that beam smoothing is possible, one is brightness 
temperature limited and it is inefficient to make very deep 
surveys searching for clusters which match a much smaller 
beam. One is always in a position to perform shorter integra- 
tions on many small beam areas and smooth them down to 
the lower resolution where the maximum number of clusters 
are available. These conclusions are particularly sensitive 
to some of the more sophisticated experimental techniques 
which are used to counteract the effects of the atmosphere. 
We have discussed two: in the case of beam switching, we 
have shown that one is not severely affected if the beam 
switch is not so small as to prevent one detecting the most 
abundant clusters which are which have a virial diameter of 
~2'. 

The discussion of the optimal design of an interferom- 
eter and the survey strategy is complicated by the extra 
freedom due to density of visibilities in the u-v plane. We 
have discussed how our methods can be applied in that case, 
but, other than asserting that the most filled interferometer 
would have the optimal design, we have not made any state- 
ments on the optimal strategy. Given a particular situation 
it might be possible to do this in more detail. 

We hope that our calculations can be used to assist ob- 
servers in their choice of survey strategy. We acknowledge 
numerous times that our predictions are model dependent. 
One could imagine a method by which one could search 
adaptively for the optimum depth of a survey as part of 
the process of performing the observation. The basic idea is 
to take some small patch of sky, compared to the potential 
total survey volume, and compute the number of sources as 
a function of Sn m . When the logarithmic gradient of this 
quantity is greater than -2, the optimum has been achieved, 
and that value of Sn m should be used on the rest of the 
survey area. Clearly, this will only be possible in scenarios 
where one is expected to find a large number of objects since 
one will have to find a sufficiently large number in the ini- 
tial test area to make the estimate of the gradient accurate 
enough. 
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Appendix A 

Let us consider computing the integral I(B, when £(#) = 
F(y) where y = 9 2 /<9 2 ir for 9 < <9 vir and zero for 9 > 9 V1I . 
The function is defined so that F(0) = 1. It is easy to see 
that for the isothermal /3-model with (3 = 2/3 



F(y) = (i 



-1/2 



tan 



^(i-y 2 ) 1/2 



tan 



(45) 



and it is likely that most sensible profile functions will have 
this property. 

If we define R = 4(# v ir / #fwhm) 2 log 2 then 



J[ e- R yF(y)dy 

So F(y)dy 



(46) 



One can see that in the limit R — > the function X — » 1 
which corresponds to the point source limit. The opposite 
limit, R — > co, is more interesting since it corresponds to 
that when the cluster is much larger than the beam. One 
can show that the leading order term in the asymptotic ex- 
pansion in 1/R is 



Jo 



R / F(y) dy 



(47) 



From (8) we see that S oc {T c ) n M v i T Id A 2 and since R v i r = 
9 v i T d\ we can deduce that 



S<x 



(r c )„M vir 



(48) 



which is finite as z — > 0. Hence, we see that the limiting 
mass tends to a finite value at z — in contrast to the case 
of a point source. 

Appendix B 

Let us consider I(B e ff,Q for B e g(9) given by that for a 
switch beam system (25). We will assume that F{y) and 
R are defined as in Appendix A, and will define Q = 
4(# s /#fwhm) 2 log 2. Using these definitions we find that 

= ^[l-e-«/ (2VB^] F {y) dy 
J F(y)dy 

Understanding the asymptotic behaviour of this integral is 
more difficult than in the standard case. In order to make 
some progress let us write F(y) as a power series 



F( y ) = J2 A ™y r: 



(50) 



where Ao = 1 and A\ < 0. From this we can deduce that 
, Am (^B m e ® Cm) 



l(B cB ,C) = - m= " 



IoF(y)dy 



where 

B m - 



f 

Jo 



— Ry m 

e y y c 



C m = J e- Ry I (2^RQy^y m dy. 



(51) 



(52) 



(53) 



One can write C m in terms of B m by writing Io(x) as a 
power series in x, 

1 



l\2 



(RQ) n B„ 



(54) 



Moreover, it can be shown that B m ~ m\/R m+l and hence 
Co ~ e Q /R and Ci ~ (Q + l)e Q /R 2 . From this we see that 
in the asymptotic expansion, the first term in (51) cancels 
and therefore 
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l(B eS X) ~ ~ Al ® (55) Viana P., Liddle A.R., 1996, MNRAS 281, 323 

R2 Io F (y) d v 

This cancellation is enough to ensure that X — ► as z — > 
sufficiently fast so that 

Soc^"f^, (56) 

in this limit. It is easy to see that this forces the limiting 
mass to tend to infinity as z — > 0. 
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